Article (refereed) -postprint Tagging of Environmental Data Using a Novel Skos Formatted Environmental Thesaurus [in Special Issue: Semantic E-sciences] Earth Title: Automated Tagging of Environmental Data Using a Novel Skos Formatted 2 Environmental Thesaurus. 3 4
نویسندگان
چکیده
The NERC and CEH trademarks and logos ('the Trademarks') are registered trademarks of NERC in the UK and other countries, and may not be used without the prior written consent of the Trademark owner. Abstract 18 There is increasing need to use the widest range of data to address issues of environmental 19 management and change, which is reflected in increasing emphasis from government 20 funding agencies for better management and access to environmental data. Bringing 21 together different environmental datasets to confidently enable integrated analysis requires 22 reference to common standards and definitions, which are frequently lacking in 23 environmental data, due to the broad subject area and lack of metadata. Automatic 24 inclusion within datasets of controlled vocabulary concepts from publicly available standard 25 vocabularies facilitates accurate annotation and promotes efficiency of metadata creation. 26 To this end, we have developed a thesaurus capable of describing environmental chemistry 27 datasets. We demonstrate a novel method for tagging datasets, via insertion of this 28 thesaurus into a Laboratory Information Management System, enabling automated tagging 29 of data, thus promoting semantic interoperability between tagged data resources. Being web 30 available, and formatted using the Simple Knowledge Organisation System (SKOS) 31 semantic standard, this thesaurus is capable of providing links both to and from other 32 relevant thesauri, thus facilitating a linked data approach. Future developments will see 33 extension of the thesaurus by the user community, in terms of both concepts included and 34 links to externally hosted vocabularies. By employing a Linked Open Data approach, we 35 anticipate that Web-based tools will be able to use concepts from the thesaurus to discover 36 and link data to other information sources, including use in national assessment of the extent 37 and condition of environmental resources. 38 39 40 41
منابع مشابه
EARTh: An Environmental Application Reference Thesaurus in the Linked Open Data cloud
The paper aims at providing a description of EARTh, the Environmental Application Reference Thesaurus. It represents a common general thesaurus for the environment, which has been published as a SKOS dataset in the Linked Open Data cloud. It promises to become a core tool for indexing and discovery environmental resources by refining and extending GEMET, which is considered the de facto standar...
متن کاملEnvironmental Data Store: Design And Implementation
In this paper we present the design and implementation of the Environmental Data Store (EDS). We also highlight the Environmental Thesaurus Server (EnvThs), a controlled vocabulary service application developed by us, providing semantic support on submission and search within EDS. With the rapid growth in data volumes, data diversity and data demands from multi-disciplinary research effort, dat...
متن کاملTheSoz: A SKOS Representation of the Thesaurus for the Social Sciences
The Thesaurus for the Social Sciences (TheSoz) is a Linked Dataset in SKOS format, which serves as a crucial instrument for information retrieval based on e.g. document indexing or search term recommendation. Thesauri and similar controlled vocabularies build a linking bridge for other datasets from the Linked Open Data cloud even between different domains. The information and knowledge, which ...
متن کاملONKI-SKOS – Publishing and Utilizing Thesauri in the Semantic Web
Thesauri and other controlled vocabularies act as building blocks of the Semantic Web by providing shared terminology for facilitating information retrieval, data exchange and integration. Representation and publishing methods are needed for utilizing thesauri efficiently, e.g., in content indexing and searching. W3C has provided the Simple Knowledge Organization System (SKOS) data model for ex...
متن کاملEnvThes - interlinked thesaurus for long term ecological research, monitoring, and experiments
The long term ecological research and monitoring is resulting in a vast amount of data describing environmental characteristics, drivers and pressures. Despite harmonisation efforts in the field of methods and observation designs within the frame of LTER Europe or other related networks, different management solutions, together with a varying set of terms and concepts describing the data, are o...
متن کامل